Storage - Content Validation Decoder Implementation by gunjansingh-msft · Pull Request #47016 · Azure/azure-sdk-for-java

gunjansingh-msft · 2025-10-15T13:22:26Z

No description provided.

github-actions · 2025-10-15T13:30:18Z

API Change Check

APIView identified API level changes in this PR and created the following API reviews

com.azure:azure-storage-blob
com.azure:azure-storage-common

kyleknap

It's looking good! It's great to see us getting a working end to implementation of downloads. I just left some higher-level comments as we keeping hashing out the download implementation.

ibrandes

i will be 100% honest - i was really struggling to walk through the logic in here, there's so much to look at x) my comments should highlight some of the more confusing areas. don't worry about polishing anything too much, but adding more subfunctions, centralizing util logic, and reducing the nesting should make this a lot easier to follow.

overall, i think you're going down the right path with how you utilize and set the decoder state. also, definitely look into downloadToFileImpl as mentioned in one of my comments - ProgressListener and ProgressReporter might be able to help us reduce some of the boilerplate code you've had to write.

github-actions · 2026-03-13T05:24:14Z

Hi @gunjansingh-msft. Thank you for your interest in helping to improve the Azure SDK experience and for your contribution. We've noticed that there hasn't been recent engagement on this pull request. If this is still an active work stream, please let us know by pushing some changes or leaving a comment. Otherwise, we'll close this out in 7 days.

ibrandes

looking great so far!

kyleknap

It's looking good. I like the overall structure of where the change is going. I have not dived too deep into the PR yet but just wanted to post some of the questions/comments that I had so far as I work through understanding the changes.

gunjansingh-msft · 2026-04-17T05:11:49Z

looking great so far!

All the latest comments are addressed

…pipelinepolicy Resolve conflicts in BuilderHelper (both decoder and encoder content validation policies) and StorageCrc64Calculator Javadoc. Align download path with ContentValidationAlgorithm rename; fix downloadStreamWithResponseInternal variable mix-up. Delegate CRC64/AUTO detection to ContentValidationModeResolver.isCrc64OrAuto from DownloadValidationUtils. Update decoder tests for setContentValidationAlgorithm; remove MD5 download test (enum no longer includes MD5). Fix incorrect @link in ContentValidationModeResolver Javadoc. Made-with: Cursor

ibrandes

looking good so far! mostly just had a couple comments on util cleanup and code consolidation - your decoder lifecycle handling looks good to me

kyleknap

It's looking good. Just left some more comments. I also agreed with all of the feedback that @ibrandes left too.

ibrandes · 2026-04-22T17:56:47Z

+            StructuredMessageDecoder decoder = new StructuredMessageDecoder(expectedLength);
+
+            Flux<ByteBuffer> decodedStream = decodeStream(httpResponse.getBody(), decoder);
+            return new DecodedResponse(httpResponse, decodedStream);


here, we are decoding the structured body into a payload body, but we are still wrapping the response with the original Content-Length header. we should add an override to DecodedResponse for the getHeaders method where we remove the Content-Length header:

@Override public HttpHeaders getHeaders() { HttpHeaders headers = new HttpHeaders(originalResponse.getHeaders()); headers.remove(HttpHeaderName.CONTENT_LENGTH); return headers; }

another option is to use the adjusted Content-Length value, but I'm not sure if we know that until everything is consumed.

This is done

@ibrandes and @gunjansingh-msft I'm not sure if I'm following why we'd remove the Content-Length header all together? My main concern is that using the Content-Length header is a common way to derive the size of blob that is being downloaded and downstream logic might rely on it being present? Might be interesting to check with what .NET does here as well to make sure we are consistent. I do generally agree that it makes sense to override the Content-Length with the value of the x-ms-structured-content-length in order to not break existing flows that leverage content length as the blob size.

I’ve aligned this with the .NET behavior in the latest revision.

In the .NET SDK, the response body is wrapped via StructuredMessageDecodingStream.WrapStream, but the response headers (including Content-Length) are not modified and continue to reflect the wire payload size.

I’ve removed the Content-Length override here as well, so the Java implementation now matches the .NET and by passing headers through unchanged.

Callers can still get the decoded length via x-ms-structured-content-length, BlobDownloadHeaders.getBlobSize(), or Content-Range if needed.

adding this as an ADO item to further investigate.

kyleknap

Looking good. Just leaving some more questions that I had as I'm working through fully understanding the implementation. Mainly focused on the policy and the decoded response class. Plan to dig more into the structured message decoder later.

kyleknap · 2026-04-24T21:18:46Z

+            StructuredMessageDecoder decoder = new StructuredMessageDecoder(expectedLength);
+
+            Flux<ByteBuffer> decodedStream = decodeStream(httpResponse.getBody(), decoder);
+            return new DecodedResponse(httpResponse, decodedStream);


@ibrandes and @gunjansingh-msft I'm not sure if I'm following why we'd remove the Content-Length header all together? My main concern is that using the Content-Length header is a common way to derive the size of blob that is being downloaded and downstream logic might rely on it being present? Might be interesting to check with what .NET does here as well to make sure we are consistent. I do generally agree that it makes sense to override the Content-Length with the value of the x-ms-structured-content-length in order to not break existing flows that leverage content length as the blob size.

kyleknap

We should be good to merge it into the feature branch as our base point. Just had one question that we can track as a follow up. From there, we can start sending smaller PR improvements to help break up the different iterations we wanted to make and make it easier to track/follow

kyleknap · 2026-04-29T21:44:44Z

+    }
+
+    @Override
+    public void close() {


Were we planning to remove this method per this comment? #47016 (comment) Mainly asking, fine to defer it for a follow up task like we are doing for the content length.

yep - thanks for catching this! looks like i accidentally reverted the removal :,)

gunjansingh-msft requested review from alzimmermsft, browndav-msft, ibrandes, kyleknap and seanmcc-msft as code owners October 15, 2025 13:22

github-actions Bot added the Storage Storage Service (Queues, Blobs, Files) label Oct 15, 2025

kyleknap reviewed Nov 19, 2025

View reviewed changes

gunjansingh-msft requested a review from a team as a code owner December 3, 2025 15:54

ibrandes reviewed Jan 12, 2026

View reviewed changes

github-actions Bot added no-recent-activity There has been no recent activity on this issue. and removed no-recent-activity There has been no recent activity on this issue. labels Mar 13, 2026

gunjansingh-msft requested review from a team, JonathanGiles, XiaofeiCao, benbp, danieljurek, g2vinay, haolingdong-msft, joshfree, kirankumarkolli, maririos, praveenkuttappan, raych1, samvaity, srnagar, weidongxu-microsoft and weshaggard as code owners March 27, 2026 08:15

ibrandes reviewed Apr 5, 2026

View reviewed changes

Comment thread ...n/java/com/azure/storage/common/implementation/contentvalidation/StorageCrc64Calculator.java Outdated

ibrandes reviewed Apr 6, 2026

View reviewed changes

Comment thread ...rage/azure-storage-blob/src/main/java/com/azure/storage/blob/specialized/BlobClientBase.java Outdated

ibrandes reviewed Apr 6, 2026

View reviewed changes

Comment thread ...n/java/com/azure/storage/common/implementation/contentvalidation/StorageCrc64Calculator.java

ibrandes reviewed Apr 6, 2026

View reviewed changes

Comment thread ...m/azure/storage/common/implementation/contentvalidation/StructuredMessageDecodingStream.java Outdated

gunjansingh-msft added 3 commits April 8, 2026 16:38

code refactoring based on latest review comments

11a2da2

simplifying retry mechanism

c907649

removing dead code

5b153a4

ibrandes reviewed Apr 8, 2026

View reviewed changes

kyleknap reviewed Apr 14, 2026

View reviewed changes

gunjansingh-msft added 2 commits April 17, 2026 10:11

addressing Kyle's review comments

f72cb8b

addressing latest review comments

20e3303

ibrandes reviewed Apr 21, 2026

View reviewed changes

kyleknap reviewed Apr 21, 2026

View reviewed changes

gunjansingh-msft added 2 commits April 22, 2026 19:30

refactoring based on latest review comments

f9befff

refactoring based on latest review comments

b3298c2

ibrandes reviewed Apr 22, 2026

View reviewed changes

refactoring based on latest review comments

ed8c3eb

kyleknap reviewed Apr 24, 2026

View reviewed changes

Comment thread ...azure-storage-blob/src/test/java/com/azure/storage/blob/BlobMessageDecoderDownloadTests.java Outdated

gunjansingh-msft and others added 8 commits April 28, 2026 20:40

refactoring based on latest review comments from kyle

d7b0c51

expanding test coverage

73f22e9

removing unused imports

c5f3a8c

recordings

2d5c9b8

adding documentation to decoder classes

c1a3f1f

small fixes and failure path tests

3c77a76

addressing context comment

aaa6414

analyze error

e08eaba

kyleknap approved these changes Apr 29, 2026

View reviewed changes

Conversation

gunjansingh-msft commented Oct 15, 2025 • edited by ibrandes Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

API Change Check

Uh oh!

kyleknap left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ibrandes left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Mar 13, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ibrandes left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kyleknap left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gunjansingh-msft commented Apr 17, 2026

Uh oh!

ibrandes left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kyleknap left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gunjansingh-msft commented Oct 15, 2025 •

edited by ibrandes

Loading

github-actions Bot commented Oct 15, 2025 •

edited

Loading